Integrating Correction into Incremental Validation

نویسندگان

  • Béatrice Bouchou-Markhoff
  • Ahmed Cheriat
  • Mirian Halfeld Ferrari Alves
  • Agata Savary
چکیده

Many data on the Web are XML documents. An XML document is an unranked labelled tree. A schema for XML documents (for instance a DTD) is the specification of their internal structure: a schema is a tree grammar, and validating a document w.r.t. a schema is done by a running of a tree automaton. Given a document, valid w.r.t. a DTD, and a sequence of updates (insertions, deletions and replacements of subtrees), we first recall how we incrementally check the validity of the resulting document. Indeed, updating a valid document requires re-checking the parts of the document concerned by the updates. Next, the core of the paper is a method to correct subtrees for which the re-validation fails: if the validator fails at node p, a correction routine is called in order to compute corrections of the subtree rooted at p, within a given threshold. Then re-validation continues. When the tree traversal is completed (i.e. all updates have been considered), the corrections generated by each call to the routine are merged, and different correction versions for the resulting document are proposed to the user. The correction routine uses tree edit distance matrices.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Mining , Validation , and Collaborative Knowledge Capture

For large-scale data mining, utilizing data from ubiquitous and mixed-structured data sources, the extraction and integration into a comprehensive data-warehouse is usually of prime importance. Then, appropriate methods for validation and potential refinement are essential. This chapter describes an approach for integrating data mining, information extraction, and validation with collaborative ...

متن کامل

L2 Writing Feedback Preferences and Their Relationships with Entity vs. Incremental Mindsets of EFL Learners

The present study was aimed at investigating intermediate Iranian EFL learners’ feedback preferences on their L2 writing and examining the possible differences between learners with entity and incremental language mindsets with respect to their feedback preferences. To this end, 150 EFL learners were recruited from several language institutes in Isfahan, Iran, and their language proficiency lev...

متن کامل

Propagation of Crack in Linear Elastic Materials with Considering Crack Path Correction Factor

Modeling of crack propagation by a finite element method under mixed mode conditions is of prime importance in the fracture mechanics. This article describes an application of finite element method to the analysis of mixed mode crack growth in linear elastic fracture mechanics. Crack - growth process is simulated by an incremental crack-extension analysis based on the maximum principal stress c...

متن کامل

Integrating Linguistic Information from Multiple Sources in Lexicon Development and

In this paper, two related spoken language-oriented projects are presented. Both projects deal with integrating linguistic information from multiple sources. The first project described is the development of a multi-purpose central lexicon database including phonemic representations. Special emphasis is put on central availability and facilitating incremental development. The second project des...

متن کامل

Integrating surprisal and uncertain-input models in online sentence comprehension: formal techniques and empirical results

A system making optimal use of available information in incremental language comprehension might be expected to use linguistic knowledge together with current input to revise beliefs about previous input. Under some circumstances, such an error-correction capability might induce comprehenders to adopt grammatical analyses that are inconsistent with the true input. Here we present a formal model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006